Prediction of word prominence
نویسندگان
چکیده
Control of prosody is essential for the synthesis of natural sounding speech. Text-to-speech systems tend to accent too many words when taking into account only the distinction between open-class and closed-class words. In the prominence-based approach [1], the degree of accentuation of a syllable is described in terms of a gradual prominence parameter. This paper presents the calculation of the prominence level of words based on their word class, the classes of the surrounding words, and their position in a clause. Rules predicting word prominence are derived from statistical analysis of a prosodic database. The hand-crafted rules are compared with the results of several machine learning algorithms on the same material. Furthermore, a perceptual test and an analysis of the resulting speech signals are carried out.
منابع مشابه
Identifying prosodic prominence patterns for English text-to-speech synthesis
This thesis proposes to improve and enrich the expressiveness of English Textto-Speech (TTS) synthesis by identifying and generating natural patterns of prosodic prominence. In most state-of-the-art TTS systems the prediction from text of prosodic prominence relations between words in an utterance relies on features that very loosely account for the combined effects of syntax, semantics, word i...
متن کاملEvaluating Metrical Phonology - a Computational- Empirical Approach
This study aims at providing an empirical basis for the evaluation of predictions made by metrical phonology. The predictions are compared to perceptual syllable prominence annotated in a database of German read speech. An evaluation baseline was defined on the correlation between prominence ratings for different speakers. Then, different sets of rules for prominence prediction were tested and ...
متن کاملAcoustical and Lexical/syntactic Features to Predict Prominence
In this study acoustical as well as lexical/syntactic correlates of prominence are analyzed and discussed. Prominence is defined at the word level and is based on listener judgments. Spoken sentences from many different speakers, taken from the Dutch Polyphone corpus of telephone speech, are analyzed. A selection of useful acoustical input features is chosen for classification of word prominenc...
متن کاملUp to what level can acoustical and textual features predict prominence
In this paper both acoustical as well as textual correlates of prominence are discussed. Prominence, as we use it, is defined at the word level and is based on listener judgments. A selection of useful acoustic input features is tested for classification of prominent words, with the help of Feed Forward Nets. We use spoken sentences from many different speakers, taken from the Dutch Polyphone c...
متن کاملA First Glimpse of Kanakanavu Word Prominence
This study investigated the word prominence pattern of Kanakanavu, a critically endangered Austronesian language spoken in Taiwan. Previous studies on the phonetic correlates of Piwan and Saisiyat agreed that pitch is the only consistent cue, indicating that Formosan languages are more like pitchaccent languages. However, given that word accents are in a fixed position for those two languages, ...
متن کامل